Skip to content

Conversation

tarukumar
Copy link
Contributor

SUMMARY:
"please provide a brief summary"

TEST PLAN:
"please outline how the changes were tested"

tarukumar and others added 11 commits September 16, 2025 21:34
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
* New models for validation

* update qwen3 metrics. add server for distil-whisper

* update value

* update whisper-large-v3 and Voxtral model acccuracy server settings

* more Voxtral server settings

* accuracy servers need to be in RedHatAI too

* update Qwen2.5-VL-7B-Instruct-FP8-Dynamic values

* try Kimi-K2 with quad

* metric value for gpt-oss-20b from a run on k8s-a100-duo

* remove empty file

Co-authored-by: Derek Kozikowski <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant